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ABSTRACT 

We have compared the transcriptomes of cultured 
procyclic Trypanosoma brucei cells in early and late 
logarithmic phases and found that ^200 mRNAs were 
differentially regulated. In late log phase cells, the 
most upregulated mRNA encoded the nucleobase 
transporter NT8. The 3' untranslated region (UTR) of 
NT8 contains a short stem-loop c/s-element that is 
necessary for the regulation of NT8 expression in re- 
sponse to external purine levels. When placed in the 
3 -UTR of an unregulated transcript, the c/s-element 
is sufficient to confer regulation in response to 
purines. To our knowledge, this is the first example of 
a discrete RNA element that can autonomously reg- 
ulate gene expression in trypanosomes in response 
to an external factor and reveals an unprecedented 
purine-dependent signaling pathway that controls 
gene expression in eukaryotes. 

INTRODUCTION 

Coordinating expression of functionally-related genes in re- 
sponse to environmental signals is essential for a cell. Recent 
genome-wide studies have shown that the global regulation 
of gene expression depends largely on post-transcriptional 
events (1). Key players in post-transcriptional regulation 
are trans-acting factors such as ribonucleic acid (RNA)- 
binding proteins, non-coding RNAs and metabolites and 
ds-acting elements, which are regulatory sequences usually 
found within the untranslated regions (UTRs) of messen- 
ger RNAs (mRNAs). The interaction between trans- and 
ds-acting factors results in the coordinated regulation of 
subsets of mRNAs encoding functionally-related proteins, 
which are known as post-transcriptional operons or regu- 
lons (1). 



Post-transcriptional regulation is of unusual importance 
in trypanosomatids. The best characterized members of this 
group of protozoans are parasites that cause severe diseases 
in humans and cattle, such as Trypanosoma brucei, Try- 
panosoma cruzi and Leishmania ssp. (2). They exhibit com- 
plex life cycles alternating between insects and mammals, 
and therefore regulate their gene expression programs in or- 
der to adapt to the changes in temperature, pH, nutrients 
and defenses they find within one or the other host (3). Evi- 
dence so far indicates that there is no selective regulation of 
transcription of protein coding genes by RNA polymerase 
II. Instead, trypanosomatids rely mainly on mRNA pro- 
cessing, turnover, translation and localization to remodel 
the proteome during the life cycle (4-6). This is thought to 
be achieved through the orchestrated interaction of RNA- 
binding proteins and c/s-acting RNA elements, and indeed 
several lines of evidence strongly support a model in which 
RNA operons operate in these organisms (4,7). In spite of 
the dependence on post-transcriptional mechanisms, very 
few RNA-binding proteins have been characterized in de- 
tail in trypanosomatids, and in even fewer instances pre- 
cisely defined c/s-acting elements have been unequivocally 
identified as being necessary and sufficient to confer regula- 
tion (5,8). These include a 26-mer stem-loop within T bru- 
cei EP1 procyclin that causes RNA instability in the blood- 
stream form (9), a 25-nt glycerol-responsive element in the 
T brucei GPEET procyclin that controls expression by glu- 
cose and during development in the insect vector (10), a 50- 
nt AU-rich region within mucin genes in T cruzi that affects 
mRNA degradation and translation (11) and a 34-nt ele- 
ment that represses expression in proliferative T brucei slen- 
der bloodstream forms and activates it in quiescent stumpy 
bloodstream parasites (12). These c/s-elements have been 
shown to be necessary but none has been shown to be func- 
tionally sufficient when placed in a different mRNA con- 
text. In Leishmania major, a retroposon-derived sequence 
of ~600 nt located within the 3' -UTR of many transcripts, 
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microtubule-based process 
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Systematic name Description of gene product 
Upregulated in late log 



log2 fold-change 



2 4 6 

■ log (P-value) 



Tb11.02.1105 nucleobase transporter NT8 3.3 

Tb927.5.1570 zinc-finger protein ZC3H 12 2.4 

Tb927.7.7500 thymine-7-hydroxylase TPL7 2.2 

Tb927.5.300 thymine-7-hydroxylase TPL5 1.8 

Tb927.5.1360 nucleoside 2-deoxyribosyltransferase NDRT 1.7 

Tb927.8.7740 amino acid transporter, putative 1.6 

Tb927.7.3160 Cytoplasmic dynein 1 heavy chain (DYNC1H1), putative 1.5 

Tb927.4.4730 amino acid transporter, AATP11 1.4 

Tb927.4.560 Cytoplasmic dynein 2 heavy chain (DYNC2H1), putative 1.4 

Tb927.8.2370 DCL1 1.3 

Down regulated in late log 

Tb927.7.3260 ESAG7 -5.1 

Tb927.1 0.1 0210 PAG4 -4.4 

Tb927.1 0.1 0230 PAG5 -3.8 

Tb927.8.480 phosphatidic acid phosphatase protein, putative -3.7 

Tb927.1 0.1 0220 PAG2 -3.5 

Tb927.5.440 trans-sialidase, putative -2.8 

Tb927.1 0.1 0240 PAG1 -2.6 

Tb1 1.55.0028 ESAG3 -2.6 

Tb927.8.3620 ESAG10 -2.3 

Tb927.6.460 PAG3 -2.1 

Tb09.21 1.4980 LRRP -1.9 

Tb927.7.170 ESAG9 -1.9 

Tb1 1.01. 0725 cation transporter -1.9 

Tb927.3.3431 rRNA5.8S(M3) -1.8 

Tb927.3.3435 rRNA large subunit delta (M2) -1 .6 

Tb927.8.2861 SRP RNA, 7SL -1.6 

Tb927.7.190 thimet oligopeptidase A, putative -1.6 

Tb927.3.3434 rRNA large subunit beta -1 .5 

Tb927.5.4170 histone H4, putative -1.5 

Tb927.7.1320 HSP10 -1.5 

Tb1 1.02. 5280 glycerol-3-phosphate dehydrogenase, mitochondrial -1.5 

Tb927.2.2020 ESAG3, putative -1.5 

Tb1 1.01. 3805 microtubule-associated protein corset-associated protein 15 -1.5 

Tb927.3.3432 rRNA large subunit alpha -1 .4 

Tb11.01.0730 cation transporter, putative -1.4 

Tb927.3.590 adenosine transporter, putative -1 .4 

Tb927.10.5840 translation elongation factor 1 -beta -1.4 

Tb927.7.4170 fatty acid elongase, putative -1.4 

Tb927. 7.4570 inosine-guanine nucleoside hydrolase -1.4 

Tb927. 1.3830 glucose-6-phosphate isomerase, glycosomal -1.3 

Tb09.211.0540 fructose-1,6-bisphosphatase -1.3 

Tb927.7.1120 trypanothione/tryparedoxin dependent peroxidase 1, cytosolic -1.3 

Tb11.01.6800 1-acyl-sn-glycerol-3-phosphate acyltransferase, putative -1.3 

Tb09.244.2070 small GTPase, putative -1.3 

Tb11. 42.0007 histone H1, putative -1.3 

Tb927.10. 14160 aquaporin 3, putative -1.3 

Tb09.1 60.371 0 proliferative cell nuclear antigen (PCNA), putative -1 .3 



Figure 1. The transcriptome of procyclic T. brucei is remodeled during the growth curve. (A) Typical growth curve of procyclic T. brucei cells. RNA samples 
were obtained from cells in early and late logarithmic phases, and subjected to high-throughput sequencing. (B) List of mRNAs whose abundance is altered 
at least ± 2.5-fold (log2 = ±1.3) in late log phase. Transcripts encoding hypothetical proteins are not shown. A complete list can be found in Supplementary 
Table SI. (C) Gene Ontology (GO) analysis of mRNAs upregulated in late log phase. Only significantly enriched categories (P < 0.01) are shown. 



named SIDER2, has been shown to promote mRNA insta- 
bility when placed downstream of a reporter gene (1 3, 14). A 
79-nt signature was shown to be essential for mRNA degra- 
dation (15), but whether these 79 nt are sufficient indepen- 
dently is not known at present. In Leishmania mexicana, a 
10-nt element termed paraflagellar rod (PFR) regulatory el- 
ement (PRE) was found to destabilize a PFR chimeric tran- 
script containing a heterologous 3-UTR transcribed from 
an episome (16). However, since sequences corresponding 
to both the 5 / -UTR and the open-reading frame of the PFR 
gene were still present in the episomal expression construct, 
it is still unknown whether the PRE could promote regu- 
lation when placed in a different genetic context. Lastly, an 
octamer sequence was identified in Crithidia fasciculata that 
could confer periodic accumulation on a heterologous tran- 



script, but only when multiple copies of the element were 
inserted within the 5 / -UTR of a reporter gene (17). 

In a search to identify novel cis- and ^raws-acting fac- 
tors that regulate gene expression in T. brucei, we com- 
pared the transcriptome of procyclic T. brucei cells in early 
and late logarithmic phases using high-throughput sequenc- 
ing (RNAseq). The most upregulated transcript in late log 
phase encodes the purine transporter NT8. We have identi- 
fied a short RNA element within the S'-UTR of NT8 that 
is necessary for purine-dependent repression of expression 
in early log phase, and found that it is also sufficient to con- 
fer regulated expression when placed in a different genetic 
context. 
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early log late log 



NT8 
7SL 



15 ± 1 min 55 ± 3 min 



ZC3H12 
7SL 



7±2min 43 ± 7 min 



Tb927.10. 
1560 

7SL 



32 ± 5 min 11 ± 2 min 



Figure 2. mRNA turnover is regulated during the growth curve. Half-lives 
of two transcripts upregulated in late log phase (NT8 and Z3CH12) and 
one downregulated in late log phase (Tb927.10.1560) were measured as 
previously described (20) from data obtained from at least three indepen- 
dent experiments. Representative northern blots are shown in each case. 
The signal recognition particle 7SL RNA was used for normalization. 
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Figure 3. Extracellular purine levels regulate mRNA abundance. (A) A 
culture in late log phase was split in two subcultures, and 100-|jlM guano- 
sine was added to one of them. RNA samples were obtained after 2-h or 
4-h incubation at 27° C, and the abundance of NT8 and Tb927. 10.1 560 
was analyzed by northern blot using specific probes. The signal recogni- 
tion particle 7SL RNA was used for normalization. (B) NT8 levels were 
also determined at shorter incubation times after the addition of guano- 
sine. The addition of 100-|jlM guanosine to early log cultures prevented the 
accumulation of NT8 later on in late log phase (leftmost lane). 



MATERIALS AND METHODS 

Trypanosome culture 

T. brucei Lister 427 procyclic cells were grown at 27° C in 
SDM-79 medium (18) containing 10% fetal bovine serum. 
Purine-deprived SDM-79 was supplemented with 10% dia- 
lyzed fetal bovine serum (Invitrogen). When used, guano- 
sine was added at a concentration of 100 |xM. Trans- 
genic trypanosomes were obtained following standard pro- 
cedures (19). 

RNA and transcriptome analysis 

Total RNA was extracted using peqGOLD Trifast (pe- 
qlab). Northern hybridizations, mRNA degradation exper- 
iments and quantitative reverse transcriptase-polymerase 
chain reaction (RT-PCR) were done as previously described 
(20,21). Complementary deoxyribonucleic acid libraries for 
high-throughput sequencing were obtained from two bi- 
ological replicates using the Illumina mRNA-seq kit ac- 
cording to manufacturer's instructions. Libraries were se- 
quenced on an Illumina platform at EASIH (Eastern Se- 
quencing and Informatics Hub, Cambridge, UK) and the 
reads of 36 nucleotides in length were aligned to the T. 
brucei TREU 927 genome v4 using Bowtie (22,23). Ma- 
nipulation and processing of the aligned tags were done 
using SAMtools (24). Aligned sequence tags were visu- 
alized using Artemis (25). The number of hits per open 
reading frame (ORF) was determined using HT-Seq soft- 
ware (http://www-huber.embl.de/users/anders/HTSeq/doc/ 
index.html). The number of reads mapping to each fea- 
ture in each condition was compared to early log samples 
using the EdgeR package (26). Gene Ontology (GO) as- 
signments were done using GOStat (27). RNA secondary 
structure predictions were obtained using RNAfold (http: 
//rna . tbi . univie. ac . at/ cgi- bin/RNAfold . cgi) . 

Lucif erase constructs and assays 

Luciferase expression constructs were derived from 
pGR108 (20) (Supplementary Figure S1A). To obtain the 
NT8 3 / -UTR, a fragment spanning the complete intergenic 
region between genes Tb917.11.3620 and Tb917.11.3630 
was amplified by PCR (Supplementary Figure SI A). Intact 
and deleted versions of the NT8 3 / UTR were cloned as 
BamH \-Hpa I fragments into pGR108 digested in the same 
way. NT8 5'UTR was cloned as a blunt-Tfzwd III fragment 
into pGR108 digested with Sma I and Hind III. Plasmids 
were digested with Not I and stably integrated in the 
tubulin locus by electroporation as described (20). A list of 
the plasmids used in this work is shown in Supplementary 
Figure SIB). 

For luciferase assays, 2^xl0 7 cells were washed in 
phosphate-buffered saline and frozen at — 80°C until use. 
Cell pellets were thawed, resuspended in lysis buffer (100- 
mM potassium phosphate, pH 7.8, 2 mM-dithiothreitol 
(DTT), 0.1% Triton X-100, 10% glycerol) at a concentra- 
tion of 0.1 ml per 1 x 10 7 cells, vortexed briefly, incubated 5 
min on ice and centrifuged at 16 OOOg for 10 min at 4°C. Ten 
to twenty microliters were used per assay using Promega's 
Luciferase Assay System in a Berthold FB 12 luminometer 
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Systematic name Description of gene product log2 fold-change 



Upregulated + guanosine 

Tb927.1 0.1 0230 
Tb927.10.10210 
Tb927.1 0.1 0220 
Tb927.1 0.1 0240 
Tb927.3.3435 
Tb927.8.2861 
Tb927.10.7160 
Tb927.3.3432 
Tb927.1 0.1 1990 
Tb927.3.3429 
Tb09.21 1.4870 
Tb927.10.10010 
Tb927. 10.6630 
Tb927.3.3434 
Tb927.8.4450 
Tb927.1 0.1 2330 
Tb927.4.2310 
Tb11.01.6600 
Tb927.7.270 
Tb927.3.2750 
Tb09.244.2790 
Tb11. 02.0100 
Tb927.8.2000 
Tb927.2.1443 
Tb927.7.190 
Tb927.4.2630 



PAG 5 4.5 

PAG4 3.2 

PAG2 3.0 

PAG1 3.0 

rRNA large subunit delta (M2) 3.0 

SRP RNA, 7SL 2.9 

PAG1 2.8 

rRNA large subunit alpha 1.9 

RNA-binding protein 1.7 

rRNA small subunit 1.7 

RNA 3'-terminal phosphate cyclase-like protein 1 .7 

60S acidic ribosomal protein, putative 1.7 

HEL64 1.7 

rRNA large subunit beta 1 .6 

RBP11 1.5 

ZC3H34 1.5 

asparaginyl-tRNA synthetase, putative 1.5 

PUF7 1.5 

ribosome biogenesis protein, putative 1.4 

hypothetical protein, conserved 1.4 

rRNA processing protein, putative 1.4 

carboxypeptidase, putative 1.3 

NCP1 (peptidyl-prolyl cis-trans isomerase) 1.3 

5.8S(M3) ribosomal RNA 1.3 

thimet oligopeptidase A, putative 1.3 

ATP-dependent DEAD/H RNA helicase, putative 1 .3 



nucleic acid binding 
ATP-dependent helicase activity 
RNA binding 



2 4 6 

- log (P-value) 



Downregulated + guanosine 

Tb11. 02.1105 NT8 -2.4 

Tb927. 3.4500 cFUM, cytosolic fumarase -2.2 

Tb11.01.7190 NADH-cytochrome b5 reductase -2.0 

Tb927. 10.2560 mMDH, mitochondrial malate dehydrogenase -1.7 

Tb11 .01 .4560 variant surface glycoprotein (VSG)-related -1 .6 

Tb927.7.6220 protein kinase, putative -1.4 

Tb927.5.1360 NDRT, nucleoside 2-deoxyribosyltransferase -1.4 

Tb927. 10.2010 hexokinase -1.4 

Tb09.21 1.0350 adenylate kinase -1.3 

Tb927. 7.4390 threonine synthase, putative -1.3 

Tb11. 01.7880 CAP1 7 (corset-associated protein 17) -1.3 

Tb927. 1.710 PGK-B (phosphoglycerate kinase B) -1.3 




flagellin-based flagellum 

carbohydrate metabolic process 
flagellar motility 
alcohol metabolic process 



2 4 6 

- log (P-value) 



Figure 4. Effect of guanosine addition on the transcriptome of late log procyclic trypanosomes. (A) A list of mRNAs whose abundance is altered at least 
± 2.5-fold (log2 = ±1.3) upon addition of guanosine to late log cells. Transcripts encoding hypothetical proteins are not shown. A complete list can be 
found in Supplementary Table S2. (B) Gene Ontology (GO) analysis of regulated mRNAs. Only significantly enriched categories (P < 0.01) are shown. 



according to manufacturer's instructions. Protein concen- 
tration was determined using Bradford reagent (BioRad). 

RESULTS 

Transcriptome analysis of procyclic trypanosomes in early 
and late logarithmic phases 

To analyze how the transcriptome of procyclic try- 
panosomes is remodeled during the growth cycle in culture, 
we prepared RNA samples from early and late logarithmic 
phase cells (Figure 1A) and compared them using deep se- 
quencing. There were ~200 transcripts with at least 2-fold 
altered abundance in late logarithmic phase, of which 80 
were upregulated and 117 downregulated. A list of the top 
(> 2. 5 -fold) regulated transcripts with known or predicted 
functions is shown in Figure IB. The complete list of all 
regulated transcripts, including hypothetical proteins, can 
be found in Supplementary Table S 1 . 

GO was used to categorize regulated transcripts accord- 
ing to their biological process, cellular component and 
molecular function. Significantly enriched GO terms (P 
< 0.01) for upregulated transcripts are shown in Figure 
1C. There was a significant enrichment for transcripts en- 
coding motor proteins such as microtubule components 
and dynein complexes. On the other hand, there was no 



significant enrichment for any GO category in the down- 
regulated mRNA set. However, many RNA-polymerase I 
transcripts (i.e. procyclic-associated genes and ribosomal 
RNAs) decreased in abundance in late logarithmic phase, a 
phenomenon also described in yeast cells entering station- 
ary phase (28,29). 

mRNA half-life is regulated during the growth cycle 

To assess whether the observed changes in mRNA abun- 
dance were due to alterations in RNA degradation rates, 
we measured the half-life of three transcripts, two upregu- 
lated in late log phase, NT8 (Tbl 1.02.1 105) and ZC3H12 
(Tb927.5.1570) (Figure IB), and one transcript downregu- 
lated in late log phase, Tb927.10.1560 (Supplementary Ta- 
ble SI). As shown in Figure 2, the half-lives of both upregu- 
lated transcripts increased, whereas it decreased in the case 
of the downregulated mRNA. These results indicate that 
the changes in mRNA steady-state levels result from differ- 
ent half-lives in early and late log phase cells. 

External purine concentration regulates mRNA abundance 

The most upregulated mRNA in late log phase was NT8 
(Figure IB) that encodes a permease that transports differ- 
ent purines with high affinity (30,3 1). We wondered whether 
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lated. A list of the most regulated (>2. 5-fold) transcripts 
with known or predicted functions is shown in Figure 4A. 
The complete list of regulated transcripts, including hy- 
pothetical proteins, can be found in Supplementary Table 
S2. GO analysis (Figure 4B) showed that RNA-binding 
proteins, especially helicases, were significantly upregulated 
in guanosine-supplemented medium. RNA-polymerase I- 
derived transcripts also increased in abundance in the pres- 
ence of guanosine. On the other hand, transcripts encoding 
enzymes involved in carbohydrate metabolism and flagel- 
lar motility were significantly enriched in the downregulated 
set. 



B 




EP1-ACT EP1-NT8 NT8-NT8 NT8-ACT 



Figure 5. The 3'-UTR of NT8 confers regulation to a luciferase reporter 
gene. Stable cell lines were generated that expressed luciferase under the 
control of NT8 5'- and/or 3'-UTRs. A cell line expressing luciferase fused 
to procyclin EP1 5'-UTR and actin 3'-UTR was used as a control. Lu- 
ciferase mRNA (A) or activity (B) levels were normalized to those of the 
control cell line in early log phase, which were set at 100%. Transcript lev- 
els were determined by quantitative RT-PCR. The actual values of relative 
luciferase activity are indicated above each bar for convenience. 



the addition of purines in late log phase could reverse the 
increased abundance of NT8 mRNA. Indeed, when guano- 
sine was added to late log cultures, the abundance of NT8 
decreased to levels close to those observed in early log phase 
(Figure 3A): NT8 levels increased 9.1 ± 1.7-fold (n = 3) 
when entering in late log phase and decreased back 8.5 
± 1.3-fold upon addition of the nucleoside. A similar ef- 
fect was observed when another purine, hypoxanthine, but 
not a pyrimidine, thymidine, was used (data not shown). 
Moreover, the mRNA Tb927.10.1560, which is downreg- 
ulated 1.8 ± 0.5-fold in late log phase, increased in abun- 
dance 2.4 ± 0.4-fold when purines were added to the cul- 
ture (Figure 3A). Remarkably, the response to guanosine 
was very fast. NT8 mRNA levels were reduced 2.4 ± 0.6- 
fold 15 min after addition of guanosine to the medium 
(Figure 3B). The addition of additional guanosine to early 
log cultures prevented the accumulation of NT8 later on 
in late log phase (Figure 3B, leftmost lane). NT8 was ap- 
parently the only purine transporter regulated during the 
growth cycle (Supplementary Table SI). We checked the ex- 
pression of a different purine transporter, NT 10 (32), and 
found that its expression was not altered during the growth 
curve (data not shown). The effect of purine addition to 
late log cultures was analyzed in more detail using RNAseq 
transcriptome analysis. There were 138 transcripts with at 
least 2-fold altered abundance in guanosine-supplemented 
medium, of which 95 were upregulated and 43 downregu- 



The 3'-UTR of NT8 confers regulation during the growth cy- 
cle 

Most regulatory RNA elements affecting mRNA half-life 
and translation have been detected in the 3 / -UTR in try- 
panosomatid genes (5). To identify c/s-acting elements re- 
sponsible for the observed regulation along the growth 
curve, we tested the 5 / -UTR and/or the 3 / -UTR of NT8 
mRNA into a luciferase reporter transgene that was inte- 
grated to form stable cell lines. As a control, we used a cell 
line transfected with plasmid pGR108, which expressed lu- 
ciferase under the control of the EP1 procyclin 5 / -UTR and 
the actin 3 / -UTR [(20), Supplementary Figure SI]. In this 
cell line, neither luciferase mRNA nor activity was differ- 
entially regulated during the growth curve (Figure 5). In 
all experiments described in this work, luciferase mRNA 
and activity levels in reporter transgenes were normalized 
to those of actin 3 / -UTR in early log phase set at 100%. 
When the NT8 3 / -UTR was fused to luciferase, a 5-fold de- 
crease in transcript levels was observed in early log phase 
as compared with the control cell line. Moreover, luciferase 
mRNA levels increased 4-fold during late log phase and de- 
creased again 2 h upon addition of guanosine to the cul- 
ture medium (Figure 5A). Replacement of the EP1 5 / -UTR 
with the NT8 5 / -UTR did not have any apparent effect on 
mRNA abundance (Figure 5A). At the protein level, the 
presence of NT8 3 / -UTR caused an ~50-fold reduction of 
luciferase activity during early log phase; the activity in- 
creased 9-fold upon entering in late log phase (Figure 5B). 
When luciferase was expressed under the control of both 
the 5'- and 3 / -UTRs of NT8, protein levels were 4- to 5-fold 
higher than those detected when using the 3 / -UTR alone in 
both growth phases (Figure 5B). This suggested that the 5'- 
UTR of NT8 enhances translation. Indeed, when luciferase 
was expressed under the control of the NT8 5 / -UTR alone, 
an ~3-fold increase in luciferase activity was observed (Fig- 
ure 5B), suggesting that the 5 / -UTR of NT8 contains RNA 
elements that stimulate translation regardless of the growth 
phase. The results also show that the 3 / -UTR of NT8 con- 
tains regulatory RNA elements that confer both repression 
of gene expression during early log phase and responsive- 
ness to purine levels in the medium. With this experimental 
design, we could not assess the effect of purines on luciferase 
activity, since there was no enough time for the protein to 
be turned over after the addition of guanosine (data not 
shown). This was probably due to the stability of luciferase 
in procyclic cells (33). Nevertheless, luciferase activity could 
be indeed modulated by purine levels (see below). 
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Figure 6. Identification of RNA regulatory elements within the 3'-UTR of NT8. (A) Serial deletions of the NT8 3'-UTR were generated starting from the 
5' end and fused to a luciferase gene with an EP1 5'-UTR. (B) Luciferase activity was normalized to that of a cell line expressing the reporter under the 
control of EP1 5'-UTR and actin 3-UTR (dashed line). (C) Predicted secondary structure of the regulatory stem-loop identified in deletion A2. RNAfold 
plots corresponding to the entire 3'-UTR can be seen in Supplementary Figure S2. 



Identification of regulatory elements within the NT8 3'-UTR 

To identify RNA regulatory sequences in the 3 / -UTR of 
NT8, we performed serial deletions of the 3 / -UTR and fused 
them to a luciferase reporter gene (Figure 6A). Deletion of 
the first 203 nt did not have any apparent effect on luciferase 
activity (deletion Al, Figure 6B). However, when the first 
365 nt were removed, repression during early log phase was 
lost (deletion A2). Further deletions in the 3 / -UTR (dele- 
tions A3 and A4) did not have additional effects on ex- 
pression in comparison with those observed in deletion A2 
(Figure 6B). These results suggested that there is a regula- 
tory element located between nucleotides 204 and 365 in the 
3 / -UTR of NT8. Secondary RNA structure predictions us- 
ing RNAfold (Supplementary Figure S2A) revealed a 60-nt 
stem-loop structure with very high base-pairing probabili- 
ties, which is shown in detail in Figure 6C. Removal of just 
these 60 nt did not apparently alter the secondary structure 
of the 3 / -UTR (Supplementary Figure S2B) and had vir- 
tually the same effect as deletions A2 to A4 (Figure 6B). 
This suggests that the short stem-loop structure is respon- 
sible for most of the observed repression of gene expression 
during early log phase. There are three genes encoding NT8 
arranged in a tandem array in the T. brucei genome (Sup- 
plementary Figure SI), and all are predicted to contain the 
stem-loop within their 3 / -UTRs (data not shown). 



The stem-loop element represses gene expression in early log 
phase and confers purine-dependent regulation 

To study whether the RNA element was sufficient to provide 
regulation in a different gene context, we modified the lu- 
ciferase transgene construct to insert the stem-loop 149 nu- 
cleotides downstream the luciferase stop codon. As shown 
in Figure 7 A, luciferase activity decreased ~ 10-fold in early 
log phase when the regulatory element was placed into the 
wild-type actin 3 / -UTR. In a second experiment, the stem- 
loop was inserted within a 3 / -UTR derived from the mito- 
chondrial malate dehydrogenase (mMDH) gene, which was 
not normally regulated during growth cycle (Supplemen- 
tary Table SI). Cells expressing luciferase fused to wild-type 
mMDH 3 / -UTR showed an activity close to that observed 
in the actin control cell line (Figure 7A). However, the inser- 



tion of the stem-loop within the 3 / -UTR of mMDH resulted 
in a 20-fold decrease in luciferase activity in early log phase. 

We next studied whether the stem-loop could also confer 
regulation in response to extracellular purine levels. We used 
cell lines expressing transgenes encoding luciferase fused to: 
(i) wild-type NT8 3 / -UTR, or (ii) NT8 3 / -UTR without the 
stem-loop, or (iii) wild-type actin 3 / -UTR, or (iv) actin 3'- 
UTR containing the stem-loop (Figure 7B). Cells were har- 
vested in early log phase (time 0 h), washed and resuspended 
in purine-depleted medium. Cultures were then split into 
two, guanosine was added to one but not the other and lu- 
ciferase activity was measured after an incubation of 8 h. 
As shown in Figure 7B, luciferase expression was strongly 
repressed in a cell line expressing the reporter under the con- 
trol of an intact NT8 3 / -UTR when guanosine was present 
in the medium, and increased ~40-fold when guanosine was 
withdrawn. However, when the stem-loop element was ab- 
sent, no regulation could be observed regardless the pres- 
ence or absence of guanosine. Moreover, insertion of the 
stem-loop within the 3 / -UTR of the control actin gene ren- 
dered it responsive to guanosine, since luciferase expression 
was again inhibited in the presence of guanosine and en- 
hanced in depleted medium. All these results show that the 
stem-loop element is both necessary and sufficient to pro- 
mote repression of gene expression during early log phase 
in response to purine levels. 

We also tested whether a shorter version of the stem- 
loop was still able to repress luciferase activity and found 
that a hairpin as short as 35 nt (containing the sequence 
below the horizontal line in Figure 6C) had a similar ef- 
fect on luciferase expression in comparison with its 60- 
mer counterpart (Figure 7C). In addition, we generated dif- 
ferent mutant versions of the 35-mer stem-loop and ana- 
lyzed whether they could still mediate repression in early 
log phase (detailed sequences and predicted structures are 
available in Supplementary Figures S3 and S4). A point mu- 
tation within the loop predicted to disrupt the whole stem- 
loop structure (Supplementary Figure S3) resulted in a loss 
of regulation (Figure 7D). We also reduced the loop length 
to 3 nt and introduced structurally non-synonymous or syn- 
onymous mutations in the stem. Regulation was lost in all 
cases (Figure 7D), suggesting that probably both sequence 
and structural features are important. 
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Figure 7. Effect of the stem-loop element on gene expression in differ- 
ent gene contexts. (A) Cell lines were generated which expressed luciferase 
under the control of actin (ACT) or mitochondrial malate dehydrogenase 
(mMDH) 3'-UTRs in which the regulatory stem-loop was inserted. (B) 
Effect of guanosine withdrawal in early log cultures expressing luciferase 
under the control of 3'-UTRs containing or lacking the regulatory stem- 
loop. The actual values of relative luciferase activity are indicated above 
each bar for convenience. (C) Sequences of the short (35 mer) wild-type 
stem-loop and mutated versions thereof. Mutations in the stem are indi- 
cated by asterisks. Detailed sequences and structures can be seen in Supple- 
mentary Figures S3 and S4. (D) Effect of shorter and mutant stem-loop 
versions on luciferase expression. Luciferase activity was normalized to 
that of a cell line expressing the reporter under the control of EP1 5'-UTR 
and actin 3-UTR (dashed line). 



DISCUSSION 

We have used high-throughput sequencing to analyze how 
trypanosomes remodel their transcriptome when entering 
in late logarithmic phase, and thus obtain valuable infor- 



mation that could be used to identify novel cis- and trans- 
acting factors involved in post-transcriptional regulation. 
We found that the abundance of ~200 transcripts was al- 
tered during the growth cycle. A previous study using mi- 
croarrays compared the transcriptome of trypanosomes in 
logarithmic phase with that of non-dividing cells in station- 
ary phase (34). These authors identified a set of ~900 regu- 
lated mRNAs with no obvious functional relationship. The 
larger number of regulated genes in their study could be ex- 
plained if one takes into account the general decline in all 
cell growth functions that occurs in stationary phase. 

The most upregulated mRNA in late log phase encoded 
the high-affinity purine transporter NT8, not previously 
identified as upregulated (34). Since trypanosomes are un- 
able to synthesize purines de novo (35), it is conceivable 
that they need to stimulate the transport of nucleobases 
when purines become scarce as cell density increases. In- 
deed, it has been shown that trypanosomes and Leishmania 
respond to purine starvation by increasing their transport 
capacity (36,37). Interestingly, we found that the addition 
of guanosine to late log cultures reversed the accumulation 
of NT8 mRNA and increased the abundance of an oth- 
erwise downregulated mRNA. Moreover, we showed that 
~140 transcripts were altered in abundance in guanosine- 
supplemented medium. Thus, purines act as an extracellu- 
lar cue that controls gene expression in trypanosomes, as it 
has been recently reported in Leishmania (38). 

We identified a short RNA element within NT8 that re- 
presses the expression of this transcript during early log 
phase. It contains a predicted stem structure consisting of 
a stretch of purines paired with pyrimidines in the oppo- 
site strand and a loop sequence (Figure 6C). Similar purine- 
pyrimidine double-stranded structures followed by regions 
of unpaired nucleotides could also be detected in the 3'- 
UTRs of other regulated transcripts, although there seems 
to be no conservation at the sequence level (Supplementary 
Figure S5). In contrast to other ds-acting motifs described 
so far in trypanosomatids, the stem-loop described in this 
work is sufficient to confer regulated gene expression when 
placed in different contexts, that is, within the 3 / -UTR of 
transcripts that are not regulated during the growth cycle. 

The RNA element identified in this work is able to 
provide regulation in a purine-dependent manner. In two 
species of the related trypanosomatid Leishmania, purine 
starvation triggers the transition toward metacyclic forms, 
the stage responsible for the initiation of infection in the ver- 
tebrate host (39). It is therefore conceivable that the same 
scenario applies to T. brucei and purine depletion triggers 
differentiation to mesocyclic or other developmental stages 
(40,41). In addition, in vitro cultures of trypanosomes in 
late log phase could be mimicking the environment faced by 
procyclic cells in the midgut of the insect vector. Thus, the 
results presented here could be useful to understand how 
trypanosomes regulate gene expression in order to survive 
in the different environments. 

How does the RNA element function? There are two 
possible mechanisms: either direct binding of nucleotide or 
via trans-acting factors. Bacterial riboswitches are RNA re- 
gions that regulate gene expression by adopting different 
conformations upon binding to specific metabolites (42). 
There is no resemblance of the RNA element described 
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here to bacterial purine riboswitches, which are longer in 
sequence and more complex in structure (43). However, a 
34-mer bacterial riboswitch consisting only of a stem-loop 
structure and a short tail has been described which selec- 
tively binds to the nucleoside queuosine (44). Alternatively, 
specific RNA-binding proteins could bind to the regula- 
tory RNA element in a purine-dependent manner in try- 
panosomes. For example, a signaling cascade could mod- 
ify an RNA-binding protein via phosphorylation, methyla- 
tion or other post-translational modification, which in turn 
would alter the binding capacity of the protein depending 
on purine levels. The actual mechanism remains to be deter- 
mined. Our work has uncovered an unprecedented mech- 
anism by which extracellular purines are able to control 
gene expression at the RNA level in eukaryotes, and pro- 
vides an interesting venue for the discovery of novel regula- 
tory molecules and mechanisms in the post-transcriptional 
world. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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